Predicting Software Outcomes Using Data Mining and Text Mining

نویسندگان

  • Uzma Raja
  • Marietta J. Tretter
چکیده

Organizations spend a major portion of their Information Technology budget on software maintenance. In this paper, we present a predictive model for the maintenance outcomes of the software projects. We also identify the factors that affect software maintenance outcomes. We build our model using Data Mining (DM) techniques on Open Source Software (OSS) project data. We use the public access to the data archives of over 100,000 projects hosted by SourceForge.net (SF). We use prior research in software engineering to identify the initial set of variables used in the model building process. We use multiple DM techniques available in SAS® Enterprise MinerTM. We also create additional new variables from the textual data provided by SF, through SAS® Text Miner. The use of these new variables improves the model, significantly. The final model is selected based on domain knowledge and fit statistics of the models. Results indicate that end-user participation, product functionality, and usefulness of the project affect the software maintenance quality.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing a System for Trend Analysis of Users in Website Surfing in Iran Using Data Mining and Text Mining Algorithms

Background and Aim: As of the entrance of web surfing to the lifestyle of a vast majority of people in the society and the need for a more accurate social and cultural policy making in the field, authors intended to analyze the behavior of the society users in viewing different websites so as to help politicians and practitioners. Methods: Design science research method is used in this research...

متن کامل

Predicting Type2 Diabetes Using Data Mining Algorithms

Background and purpose: Today, information systems and databases are widely used and in order to achieve higher accuracy and speed in making diagnosis, preventing the diseases, and choosing treatments they should be merged with traditional methods. This study aimed at presenting an accurate system for diagnosis of diabetes using data mining and a heuristic method combining neural network and pa...

متن کامل

Predicting OSS Development Success: A Data Mining Approach

Open Source Software (OSS) has reached new levels of sophistication and acceptance by users and commercial software vendors. This research creates tests and validates a model for predicting successful development of OSS projects. Widely available archival data was used for OSS projects from Sourceforge. net. The data is analyzed with multiple Data Mining techniques. Initially three competing mo...

متن کامل

Data, text and web mining for business intelligence: a survey

The Information and Communication Technologies revolution brought a digital world with huge amounts of data available. Enterprises use mining technologies to search vast amounts of data for vital insight and knowledge. Mining tools such as data mining, text mining, and web mining are used to find hidden knowledge in large databases or the Internet. Mining tools are automated software tools used...

متن کامل

Predicting Bankruptcy of Companies using Data Mining Models and Comparing the Results with Z Altman Model

One of the issues helping make investment decisions is appropriate tools and models to evaluate financial situation 0f the organization.  By means of these tools, investors can analyze financial situation of the organization and identify financial distress or an ideal condition, they become aware of making decisions to invest in appropriate conditions.  The main objective of this study is to ev...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007